Using Co-occurrence, Augmented Restrictions, and C-E WordNet for Chinese-English Cross-Language Information Retrieval at CLEF 2001
نویسندگان
چکیده
This paper reports the work of NTU in the bilingual-retrieval task at CLEF 2001. In this experiment, we compared the effectiveness of several approaches in Chinese-English cross-language information retrieval. Five models were proposed. Model 1 used co-occurrence information in the target language to disambiguate translation equivalents; Model 2 augmented restriction terms to the original queries to restrict the use of query terms in the target language; Model 3 used a Chinese-English WordNet to translate queries; Model 4 combined Model 3 with Model 2; Model 5 merged the queries constructed by Model 2 and 3.
منابع مشابه
NTU at CLEF 2001: Chinese-English Cross-Lingual Information Retrieval
This paper reports the work of NTU on bilingual-retrieval task at CLEF 2001. We proposed five models. Model 1 used co-occurrence information to disambiguate translation equivalents; Model 2 augmented restriction terms to the original queries; Model 3 used C-E WordNet to translate queries; Model 4 combined Model 3 with Model 2; Model 5 merged the queries constructed by Model 2 and 3. The best on...
متن کاملTrans-EZ at NTCIR-2 : Synset Co-occurrence Method for English-Chinese Cross-Lingual Information Retrieval
In this paper, a new method for English-Chinese cross-lingual information retrieval is proposed and evaluated in NTCIR-II project. We use the bilingual resources and contextual information to deal with the word sense disambiguation (WSD) and translation disambiguation for query translation. An EnglishChinese WordNet and a synset co-occurrence model are adopted to solve the problem of word sense...
متن کاملExploiting the LDC Chinese-English Bilingual Wordlist for Cross Language Information Retrieval
We investigated using the LDC English/Chinese bilingual wordlists for English-Chinese cross language retrieval. It is shown that the Chinese-to-English wordlist can be considered as both a phrase and word dictionary, and is preferable to the English-to-Chinese version in terms of phrase translation and word translation selection. Additional techniques such as frequency-based term selection, tra...
متن کاملDisambiguation and Unknown Term Translation in Cross Language Information Retrieval
In this paper we present a report on our participation in the CLEF 2007 Chinese-English ad hoc bilingual track. We discuss a disambiguation strategy which employs a modified co-occurrence model to determine the most appropriate translation for a given query. This strategy is used alongside a pattern-based translation extraction method which addresses the ‘unknown term’ translation problem. Expe...
متن کاملResolving Translation Ambiguity using Monolingual Corpora. A Report on Clairvoyance CLEF-2002 Experiments
Choosing the correct target words is a difficult problem for machine translation. In cross-language information retrieval, this problem of choice is mitigated since more than one translation alternative can be retained in the target query. Between choosing just one word as a translation and keeping all the possible translations for each source word, one can apply a range of filtering techniques...
متن کامل